CONFORMAL WASSERSTEIN DISTANCES: COMPARING SURFACES IN 

POLYNOMIAL TIME 



Y. LIPMAN, I. DAUBECHIES 

Abstract. We present a constructive approach to surface comparison realizable by a polynomial-time 
algorithm. We determine the "similarity" of two given surfaces by solving a mass-transportation problem 
between their conformal densities. This mass transportation problem differs from the standard case in that 
we require the solution to be invariant under global Mobius transformations. We present in detail the case 
where the surfaces to compare are disk-like; we also sketch how the approach can be generalized to other 
types of surfaces. 



1. INTRODUCTION 

Alignment and comparison of surfaces (2-manifolds) play a central role in a wide range of scientific disciplines; 
they often constitute a crucial step in a variety of problems in medicine and biology. 

Mathematically, the algorithmic problem of surface alignment amounts to defining a metric function d(-, •) 
in the space of Riemannian 2-manifolds with the following two properties: 1) for any two surfaces M and TV, 
d(M,J\f) = implies that M. and M are isometric, and 2) given a reasonably large number of reasonably 
well-distributed sample points on both surfaces, an accurate approximation to the distance d(M,Af) can be 
calculated in a time that grows only polynomially in the sample set size. This second requirement is crucial 
to ensure that the algorithm can be used effectively in applications. 

A prominent mathematical approach to define distances between surfaces that has been proposed for practical 
applications [TOl IH] 1S the Gromov-Hausdorff (GH) distance; it considers the surfaces as special cases of 
metric spaces. To determine the GH distance between the metric spaces X and Y, one examines all the 
isometric embeddings of A and Y into (other) metric spaces; although this distance possesses many attractive 
mathematical properties, it is inherently hard computationally. For instance, computing the L p version of the 
GH distance between two surfaces is equivalent to a non-convex quadratic programming problem, generally 
over the integers [5]. This problem is equivalent to integer quadratic assignment, and is thus NP-hard }13j . 
In [S], Memoli generalizes the GH distance of [TUJ by introducing a quadratic mass transportation scheme 
to be applied to metric spaces that are also equipped with a measure (mm spaces); the computation of 
this Gromov-Wasscrstcin (GW) distance for mm spaces is somewhat easier and more stable to implement 
than the original GH distance. The computation of the GW distance between two surfaces described in [S] 
utilizes a (continuous rather than integer) quadratic programming method; the functional to be minimized is 
generally not convex and optimization methods are likely to find local minima rather than the global minima 
that realizes the surfaces' distance. 

In this paper we propose a new surface alignment procedure, introducing the conformal Wasserstein distance. 
Our construction consists in "geometrically" aligning the surfaces, based on uniformization theory and opti- 
mal mass transportation. The uniformization theory serves as a "dimensionality reduction" tool, representing 
e.g. a disk-type surface by its conformal factor on the unit disk: the corresponding automorphism group (the 
disk-preserving Mobius group) has only three degrees of freedom and is therefore searchable in polynomial 
time. Next, the Kantorovich mass-transportation [3] is used to construct a linear functional the minimizer 
of which furnishes a metric; as is well-known [21 [4], this can solved by a linear program, and can thus be 
computed/approximated in polynomial time as- well. 
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As far as we know, prior to our work, no polynomial time algorithm was known to compute, either exactly 
or up to a good approximation, the GH distance or any other proposed intrinsic geometric distances between 
surfaces. Although |10j uses a mass transportation as well (albeit quadratic mass transportation), our ap- 
proach is nevertheless different. We solve the "standard" (and thus linear) Kantorovich mass transportation 
problem, which is convex (even linear) and solvable via a linear programming method. 

There exist earlier papers on aligning or comparing surfaces that use uniformization. In particular, the 
papers by Zeng et al. [?, ?] which build upon the work of Gu and Yau [12], also use uniformization 
for surface alignment (albeit without defining a distance between surfaces). However, they use prescribed 
feature points (defined either by the user or by extra texture information) to calculate an interpolating 
harmonic map between the uniformization spaces, and then define the final correspondence as a composition 
of the uniformization maps and this harmonic interpolant. We use only intrinsic geometric information: we 
make use of the surfaces' metric (inherited from its embedding in M 3 ) and the induced conformal structure 
to define deviation from (local) isometry. 

Optimal mass transportation has also been used before in aligning or comparing images. Following the 
seminal work by Rubner et al. [3J, it is used extensively in the engineering literature to define interesting 
metric distances for images, interpreted as probability densities; in this context the metric is often called the 
" Earth Mover's Distance" . 

Our paper is organized as follows: in Section[2]we briefly recall some facts about uniformization and optimal 
mass transportation that we shall use, at the same time introducing our notation. Section|3]contains the main 
results of this paper, constructing the conformal Wasserstein distance metric between disk-type surfaces, in 
several steps; we also indicate how the approach can be generalized to other surfaces. Section [4] briefly 
describes the discrete case. 



As described in the introduction, our framework makes use of two mathematical theories: uniformization 
theory, to represent the surfaces as measures defined on a canonical domain, and optimal mass transportation, 
to align the measures. In this section we recall some of their basic properties, and we introduce our notations. 

2.1. Uniformization. By the celebrated uniformization theory for Riemann surfaces (see for example [3[H]), 
any simply-connected Riemann surface is conformally equivalent to one of three canonical domains: the 
sphere, the complex plane, or the unit disk. Since every 2-manifold surface M equipped with a smooth 
Riemannian metric g has an induced conformal structure and is thus a Riemann surface, uniformization 
applies to such surfaces. Therefore, every simply-connected surface with a Riemannian metric can be mapped 
conformally to one of the three canonical domains listed above. In this paper, we discuss 2D surfaces, 
equipped with a Riemannian metric tensor g (possibly inherited from the standard 3D metric if the surface 
is embedded in M 3 ) that have a finite total volume (i.e. area, since we are delaing with surfaces). For 
convenience, we shall normalize the metric so that the surface area equals 1. We shall discuss in detail the 
case where the surfaces A4 are topologically equivalent to disks. (We shall address in side remarks how the 
approach can be extended to the other cases.) For each such M. there exists a conformal map <fi : A4 — >• T>, 
where T> = {z ; \z\ < 1} is the open unit disk, (we assume that M. does not include its boundary, if it has 
one). The map </> pushes g to a metric on T>; denoting the coordinates in T> by z — x 1 + ix 2 , we can write 
this metric as 



where Ji{z) > 0, Einstein summation convention is used, and the subscript * denotes the "push-forward" 
action. The function Jl can also be viewed as the density function of the measure voLvi induced by the 
Riemann volume element: indeed, for (measurable) A C M., 



2. Background and Notations 



g — <f>*g = n(z) Sij dx 1 ® dx° , 



(2.1) 
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It will be convenient to use the hyperbolic metric (1 — \z\ 2 )~ 2 8ijdx l ®dx^ as the reference metric on the unit 
disk, rather than the standard Euclidean Sijdx 1 ® dx 3 ] note that the two are conformally equivalent (with 
conformal factor (1 — |z| 2 )~ 2 ). Instead of the density Jl(z), we shall therefore use the hyperbolic density 
function 

(2.2) M ff (z):=(l-|z| 2 ) 2 M(z), 

where the superscript H stands for hyperbolic. We shall often drop this superscript: unless otherwise stated 
fi = fi H , and v = v H . This density function n satisfies 



voIm{A) = / fi(z) dvol H (z) , 

J<t>(A) 

where dvolniz) = (1 — |z| 2 ) -2 dx 1 A dx 2 . In what follows we shall use the symbol \i both for the function /u, H 
and as a shorthand for the absolutely continuous measure voL» , and by extension for the surface M itself. 

The conformal mappings of T> to itself are the disk-preserving Mobius transformations m £ M-p , a family 
with three real parameters, defined by 

(2.3) m(z) = e ie ^^, «eD, 9 G [0, 2tt). 

1 — az 

Since these Mobius transformations satisfy 

(2.4) (lHm(z)|T 2 |m'(z)| 2 = (l-MT 2 , 

where m! stands for the derivatives of m, the pull-back of /i under a mapping m £ Mx> takes on a particularly 
simple expression. Setting w = m(z), with w = y 1 + iy 2 , and g(w) = jl(w)Sijdy' 1 ® dy 3 = fi(w)(l — 
\w\ 2 )~ 2 Sijdy l ® dy 3 , the definition 



implies 



(m*g)(z)kt dx k ® dx e := fi(w) (1 - |w| 2 ) 2 <% dy i ® dy? 



(m*g) ke {z)dx k ®dx e = //(m(z))(l- \m{z)\ 2 ) 2 <% ^ ^- [ dx k ®dx i 
= n{m{z)) (1 - |m(z)| 2 )- 2 |m'(z)| 2 5 M <fc* ® da/ 
= n(m(z)) (1 - |z| 2 )~ 2 4f efe fc cfaf 



In other words, (m*g)(z)ki dx k <S> dx 1 takes on the simple form m*fj,(z) (1 — |z| 2 ) -2 5^ cfe fc ® dx £ , with 

(2.5) m*fi(z) = fi(m(z)). 

Likewise, the push-forward, under a disk Mobius transform m(z) — w, of the (diagonal) Riemannian metric 
defined by the density function /i = fi H , is again a diagonal metric, with (hyperbolic) density function 
m*[j,(w) = (m*/i) (w) given by 

(2.6) m*[j,(w) = /i(m^ 1 (w)). 

It follows that checking whether or not two surfaces M. and M are isometric, or searching for (near-) isometries 
between Ai and A/", is greatly simplified by considering the conformal mappings from M., fif to T>: once the 
(hyperbolic) density functions fi and v are known, it suffices to identify m € Mp such that v(m(z)) and 
H(z) coincide (or "nearly" coincide, in a sense to be made precise). This was exploited in [H] to construct 
fast algorithms to find corresponding points between two given surfaces. In the next section we provide a 
precise formalization of this idea using the notion of optimal mass transportation, described in the following 
subsection. 



Comparing Surfaces in Polynomial Time 



4 



2.2. Optimal mass transportation. Optimal mass transportation was introduced by G. Monge pQ, and L. 
Kantorovich [3] . It concerns the transformation of one mass distribution into another while minimizing a cost 
function that can be viewed as the amount of work required for the task. In the Kantorovich formulation, to 
which wc shall stick in this paper, one considers two measure spaces X, Y (each equipped with a er-algcbra), 
a probability measure on each, /i e P(X), v e P(Y) (where P(X), P(Y) are the respective spaces of all 
probability measures on X and Y), and the space II(^, v) of probability measures 7r on X x Y with marginals 
fi and v (resp.), that is, for A C X, B c Y, tt(A x Y) = (J,(A) and n(X x B) = v(B). The optimal mass 
transportation is the element of II(/i, v) that minimizes J XxY d{x, y)dit(x 1 y), where d(x, y) is a cost function. 
(In general, one should consider an infimum rather than a minimum; in our case, X and Y are compact, 
d(-, •) is continuous, and the infimum is achieved.) The corresponding minimum, 

(2.7) T d {n,v)= inf / d(x,y)dn(x,y), 

7ren(M,i>) JxxY 

is the optimal mass transportation distance between \i and v, with respect to the cost function d(x,y). 

Intuitively, one can interpret this as follows: imagine being confronted with a pile of sand on the one hand 
(corresponding to fj,), and a hole in the ground on the other hand (—u), and assume that the volume of the 
sand pile equals exactly the volume of the hole (suitably normalized, /i, v are probability measures). You 
wish to fill the hole with the sand from the pile (it 6 n(/i, v)), in a way that minimizes the amount of work 
(represented by J d(x 1 y)dir(x,y), where d(-, •) can be thought of as a distance function). 

In what follows, we shall apply this framework to the density functions /i and v on the hyperbolic disk T> 
obtained by conformal mappings from two surfaces Ai, J\f, as described in the previous subsection. 

The Kantorovich transportation framework cannot be applied directly to the densities /i, v. Indeed, the 
density fj,, characterizing the Riemannian metric on T> obtained by pushing forward the metric on M. via 
the uniformizing map <f> : M — > T>, is not uniquely defined: another uniformizing map cf>' : M. — > T> may 
well produce a different p! . Because the two representations are necessarily isometric (<^ _1 ° <f>' maps M. 
isometrically to itself), we must have //(m(z)) = /i(z) for some m £ Mp. (In fact, m = <f)' a In a 

sense, the representation of (disk-type) surfaces M. as measures over V should be considered "modulo" the 
disk Mobius transformations. 

We thus need to address how to adapt the optimal transportation framework to factor out this Mobius 
transformation ambiguity. The next section starts by showing how this can be done. 

3. The conformal Wasserstein framework: optimal volume transportation for surfaces 

We want to measure distances between surfaces by using the Kantorovich transportation framework to 
measure the transportation between the metric densities on T> obtained by uniformization applied to the 
surfaces. The main obstacle is that these metric densities arc not uniquely defined; they arc defined up to a 
Mobius transformation. In particular, if two densities [i and v are related by v = m*/i (i.e. fi(z) — v(m(z))) 1 
where m 6 M-p , then we want our putative distance between /i and v to be zero, since they describe isometric 
surfaces, and could have been obtained by different uniformization maps of the same surface. We thus want 
a distance metric between orbits of the group M-p acting on the conformal factors rather than a metric 
distance between the conformal factors themselves. If we choose a metric distance d on I? that is invariant 
under Mobius transformations, i.e. that it is a multiple of the hyperbolic distance on the disk, then a natural 
definition is as follows 

(3.1) T)(/i, z/)= inf inf / d(z, w) dir(z, w) I . 

As shown in the Appendix, D(/i, v) is indeed a distance between disk- type surfaces; its computation can 
moreover be implemented in running times that grow only polynomially in the number N of sample points 
used in the discretization of the surface (necessary to proceed to numerical computation). The optimization 
over m in the definition of D(/z, v) always achieves its minimum in some m (depending on (i and v of 
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course); denoting this special minimizing m € Mp by m^, we can rewrite D(/x, v) as the result of a single 
minimization (for details, see Appendix): 



D(/x, v) = inf / d(z,w) dn(z,w) 



(3.2) = inf / d(m lit u{z),w) dir(z,w) . 

7rem>,i/) JpxD 

This is, however, purely formal; since the determination of v involves the original double minimization of 



(3.1), a numerical implementation does require solving a mass-transportation functional for many m € Mp. 
In practice, this means that, despite its polynomial running time complexity, the numerical computation of 
TJ(/n, v) is too heavy for many applications, in which all pairwise distances must be computed for a collection 
of that may contain hundreds of surfaces [?]. This seems to lead to an impasse, since there exists no other 
distance metric on T> that is conformally invariant, so that the natural "quotienting operation" over the 
group Mp can produce no other metric than D(-, •). 



However, T)(/i, u), rewritten as in (3.2 ), suggests another way in which we can define an appropriate distance 
metric between orbits of the group Mp acting on the conformal factors. Note that ( |3.2[ ) has exactly the 
same form as for a standard Kantorovich mass transportation scheme, except for the (crucial) difference 
that the cost function depends on /i and v. By retaining the idea of (Kantorovich) mass transportation, 
but allowing the use of cost functions d(-,-) in the integrand that depend on /i and v (without picking 
them necessarily of the form d{m lJL ^{z), w)), we can construct other distance metrics D on the conformal 
factors that are invariant under action of Mp, i.e. for which £>(//, j/) = D(m*[i,v) for all m € Mp. In 
addition, we can pick cost functions of this type ensuring that the distance between (or dissimilarity of) fj, 
and v exhibits some robustness with respect to deviations from global isometry. More precisely, we want the 
distance to be small for surfaces that are not isometric but nevertheless very close to isometric on parts of 
the surfaces; this can be achieved by picking a cost function d^ u (z, w) that depends on a comparison of the 
behavior of fi and v on neighborhoods of z and w, mapped by m ranging over Mp. This cost function, once 
incorporated in the Kantorovich mass transportation framework, will lead to a metric between disk-type 
surfaces (some generic conditions aside) based on solving a single mass transportation problem. The next 
subsection shows precisely how this is done. As is the case throughout the paper, we first give the full details 
of the construction for disk-like surfaces, and then indicate later how to generalize this to e.g. sphere-like 
surfaces. It is worthwhile to note that the "quotient approach" sketched above would not even have been 
applicable in a straightforward way to sphere-like surfaces, since they do not possess a metric invariant under 
all their Mobius transformations. As we shall explain at the end of this section, the same obstruction will 
not exist for the construction introduced in the next subsection. 



3.1. Construction of d^ v (z,w). We construct djj l/ (z,w) so that it indicates the extent to which a neigh- 
borhood of the point z in (T>, fi), the (conformal representation of the) first surface, is isometric with a neigh- 
borhood of the point w in (T>, v) : the (conformal representation of the) second surface. We will need to define 
two ingredients for this: the neighborhoods we will use, and how we shall characterize the (dis)similarity of 
two neighborhoods, equipped with different metrics. 

We start with the neighborhoods. 

For a fixed radius R > 0, we define Q Zo ,r to be the hyperbolic geodesic disk of radius R centered at zq. 
The following gives an easy procedure to construct these disks. If zq = 0, then the hyperbolic geodesic disks 
centered at zq = are also "standard" (i.e. Euclidean) disks centered at 0: fio.i? = { z ', \ z \ < r n}, where 
rji = tanh(i?). The hyperbolic disks around other centers are images of these central disks under Mobius 
transformations (= hyperbolic isometries): setting m(z) = (z — z Q )(l — zz ) _1 , we have 



(3.3) 



Qz M = m 1 (fi ,il) 



Comparing Surfaces in Polynomial Time 



G 



If ml ', m" are two maps in Mx> that both map Zq to 0, then to" o (to') -1 simply rotates f2o,-R around its 
center, over some angle 9 determined by to' and to". From this observation one easily checks that (3.3) holds 
for any m 6 A/p that maps z to 0. In fact, we have the following more general 

Lemma 3.1. For arbitrary z,w € T> and any R > 0, every disk preserving Mobius transformation m € M-p 
that maps z to w (i.e. w = m(z)) also maps CI Zi r to £l Wt R. 

Next we define how to quantify the (dis)similarity of the pairs (Cl Zo ,R > A*) ancl {^w ,R i 17 )■ Since (global) 
isometries are given by the elements of the disk-preserving Mobius group Mp, we will test the extent to 
which the two patches are isometric by comparing (CI WOj r , v ) with all the images of (Q Zo ,R > t 1 ) under Mobius 
transformations in Mx> that take Zq to u>o- 

To carry out this comparison, we need a norm. Any metric gij{z)dx l <8> dx 3 induces an inner product on 
the space of 2-covariant tensors, as follows: if a(z) = a^-(z) dx l <S> dx J and b(z) = cte 1 ® da;- 7 are two 

2-covariant tensors in our parameter space T>, then their inner product is defined by 



(3.4) 



(a(z)Mz)) = <Hi(z)l>kt(z)g tk (z)g> t {z) 



as always, this inner product defines a norm, ||a|| 2 = aij(z) ake(z) g lk (z) g^(z). Let us apply this to the 
computation of the norm of the difference between the local metric on one surface, g%j[z) = fJ,(z)(l — \z\ 2 )~ 2 Sij, 
and hij(w) — u(w)(l — \w\ ) — 5y, the pull-back metric from the other surface by a Mobius transformation 



to. Using (3.4), (2.5), and writing 6, g, h, for the tensors with entries 5u, g$j, and hij, respectively, we have: 
\\g - m*hf z = || ,i(z)(l - |z| 2 )- 2 ^ - v{m{z)){l - \z\ 2 )- 2 8 || 2 



= ( M (z) - v{m{z)))\l - |z| 2 )- 4 S U g ik (z) g^(z) 



= 1- 



v (m(z)) 
H(z) 



For every pair of /x, z/, we are now ready to define the distance function <2„ „(•, •) on 2? 



(3.5) 



inf 

m£M-D , 



I /x(z) - (m,*v)(z) I dvolfl-(z), 



m(z )=w 

where dvolu(z) = (1 — |z| 2 )~ 2 dx A dy is the volume form for the hyperbolic disk. The integral in (3.5) can 
also be written in the following form, which makes its invariance more readily apparent: 

v(m(z)) , , . /' 
Jn, 



(3.6) 



1 



m*h\\ z dvo\jv{ (z), 



where rfvolx(z) = /i(z)(l — |z| 2 ) 2 dx 1 A dx 2 = \f\g~ij\dx 1 A dx 2 is the volume form of the first surface A4. 



The next Lemma shows that although the integration in (3.6) is carried out w.r.t. the volume of the first 
surface, this measure of distance is nevertheless symmetric: 



Lemma 3.2. If m G Af-p maps zq to wq, to(zq) = wq, then 



z ,R 



/i(z) — m*v(z) dvoln(z) 



«0-« 



m*(i(w) — v(w) dvoljj(w) 



Proof. By the pull-back formula (2.5), we have 



/x(z) — to*^(z) dvolff (z) 



n, 



/i(z) — i/(m(z)) dvoljj(z) 



Performing the change of coordinates z = m x (w) in the integral on the right hand side, we obtain 

fj,(m~ (w)) — u(w) dvo\H(w), 
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where we have used that m 1 is an isometry and therefore preserves the volume element dvolufw) = (1 — 
|w| 2 ) -2 dy 1 A dy 2 . By Lemma 3.1 m(f2 Z0! _R.) = Cl WQ g. ', using the push-forward formula (2.6 1 then allows to 



conclude. □ 

Note that our point of view in defining our "distance" between z and w differs from the classical point of 
view in mass transportation: traditionally, d(z, w) is some sort of physical distance between the points z and 
w; in our case d^ v (z,w) measures the dissimilarity of (neighborhoods of) z and w. 

The next Theorem lists some important properties of d^ v ; its proof is given in the Appendix. 

Theorem 3.3. The distance function d3 u (z,w) satisfies the following properties 

(1) ^m*M.m;i'( m r 1 ( 2; o)) m 2 1 { w o)) = d^ u (z ,w ) Invariance under (well-defined) 

Mobius changes of coordinates 

( 2 ) d^ >u (z ,w ) = d^(w ,z ) Symmetry 

(3) d?„(zo,ifo) > Non-negativity 

(4) dp „(zo,uj ) = =>■ Q Zo ,r m (2? , (J,) and £1 WOi r in (T>,v) are isometric 

(5) d%* ViV (mr 1 (z ),Zo) = Reflexivity 

(6) < ,^ 3 ( z ii z 3) < dj} l f t 2 (zi, Z2] + d^ 2413 (z2, z%) Triangle inequality 



In addition, the function dR v :T> xP — > R is continuous. To show this, we first look a little more closely at 
the 1-parameter family of disk Mobius transformations that map one pre-assigned point zq € T> to another 
pre-assigned point wo G 2?. 

Definition 3.4. For any pair of points zo, wq € 2?, we denote by Md,z , Wo the set of Mobius transformations 
that map zq to %. 

This family of Mobius transformations is completely characterized by the following lemma: 

Lemma 3.5. For any zq,wq G T>, the set Mn,zo,w constitutes a 1-parameter family of disk Mobius trans- 
formations, parametrized continuously over S 1 (the unit circle). More precisely, every m G Mu, Zo ,w is of 
the form 

/q ~\ f ■> z-a z -w a l-z^w a 

(0.1) m(z) = t — , with a = a(z , Wo, o) '■= — = and t = t(z ,w , a) := a , 

1 — az 1 — zo wq a 1 — zo wq a 

where a G S\ :— {z G C ; |z| = 1} can be chosen freely. 



Proof. By (2.3 1, the disk Mobius transformations that map zq to all have the form 



z-z ^ ,. , . , . 1 ,.. A _ w + e'^zo 



m^, j 2 (z) = e 1 "^ ^— , the inverse of which is Jn^A (w) = e 



1-zq-z ' <^° v ; l + e-^z^w ' 

where rp G M can be set arbitrarily. It follows that the elements of Md^^ are given by the family 
m~ 1 Wo o m^ )Zo , with ip, 7 G K.. Working this out, one finds that these combinations of Mobius transformations 



take the form (3.7), with a = e^"^ . □ 
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We shall denote by m Zo ^ WOt<T the special disk Mobius transformation defined by (3.7|. In view of our interest 
in dp V , we also define the auxiliary function 

$ : D x D x Si — >C 

(z , w , a) i — > / | n(z) - v{m ZOlWOia {z)) I dvo\ H {z) . 

This function has the following continuity properties, inherited from /x and v: 
Lemma 3.6. 

• For each fixed (zq, Wq), the function $(zo, wq, ■) is continuous on S±. 

• For each fixed a € S\, $(•,-, c) is continuous onV x V. Moreover, the family ($(•,-, a) ) is equicon- 
tinuous. 

Proof. The proof of this Lemma is given in the Appendix. □ 



Note that since S 1 is compact, Lemma 3.6 implies that the infimum in the definition of d^ v can be replaced 
by a minimum: 



du,A z 0, w Q ) = min / | fi(z) - v{m{z)) \ dvol H (z) . 

m{z )=w a Jn zo<R 

We have now all the building blocks to prove 

Theorem 3.7. If fx and v are continuous from T> to R, then d^(z,w) is a continuous function on T> x T>. 
Proof. Pick an arbitrary point (z ,wq) £"D xT>, and pick e > arbitrarily small. 



By Lemma 3.6 there exists a S > such that, for \z' Q — zq\ < S, \w — wq\ < S, we have 

\$(z ,w ,a)-$(z ,w ,o-) \ <e, 
uniformly in a. Pick now arbitrary z' ,w so that \zq — Zq|, \wq — w \ < 5. 

Let m Zo _ WO:<7 , resp. Tn z ',w',cr') be the minimizing Mobius transform in the definition of dj}„(zo, Wq), resp. 
d^ u ( z O' w o) = ®{zo,w ,a) and d^„{z' ,w' Q ) = <J>(z , w , cr') . 

It then follows that 
d^ L ,(z ,w a ) = m.m$(z Q ,w 0l T) < ^(z ,w ,a / ) 

< ®(z ,w ,a') + \$(z ,w ,a') - $(z ,w ,a')\ = d* u { z Oi w o) + l*( z o, w , a') - $(z' ,w ,a')\ 

< d*v( z 'oi w 'o) + SU P \®(zo,w ,u) - $(z' ,w' ,u})\ < d^ v (z' ,w' ) +e . 

Likewise d^ v ( z 'o' w 'o) ^ d^„(z ,w ) +e, so that \d^ u ( z a> w o) ~ d% u ( z a> w b)\ < £ - 1=1 

The function djj \, can be extended to a uniformly continuous function on the closed disk, by using the 
following lemma, proved in the Appendix. 

Lemma 3.8. Let {(z kl ^&)}fc>i C T> xT> be a sequence that converges, in the Euclidean norm, to some point 
in (z' , w') E D x r D\D x D, that is \z k — z'\ + \wk — w'\ — > 0, as k — > oo. Then, \im.k^oo d^(zk, w k ) exists 
and depends only on the limit point [z',w'). 
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3.2. Incorporating d R (z,w) into the transportation framework. The next step in constructing the 
distance operator between surfaces is to incorporate the distance d R „(z, w) defined in the previous subsection 
into the (generalized) Kantorovich transportation model: 

(3.8) T?(ji,u)= inf f d R ly (z,w)dn(z,w). 

The main result is that this procedure (under some extra conditions) furnishes a metric between (disk-type) 
surfaces. 

Theorem 3.9. There exists ir* S n(/i, v) such that 

d R „(z,w)dir* (z,w) — inf / d R (z,w)dn(z,w). 



Proof. With a slight abuse of notation, we denote by fi the probability measure on V that is absolutely 
continuous with respect to th e hyperbolic measure on T>, with density function equal to the continous function 



(i on V. (See subsection 2.1 ) We now define a probability measure ponD by setting ~p(A) = ^(AnV), for 
arbitrary Borel sets A c T> ; V is defined analogously. By the Riesz-Markov theorem, the space of probability 
measures V{T> x T>) can be viewed as a (closed) subset of the unit ball in C(D X D)*. As such, both 
■p(2? x T>) and its closed subset H(ji,V) are weak*-compact, by the Banach-Alaoglu theorem. Note that for 
each 7f G n(/I, V), we have 

7f(X> xV)> W(V xV)- (W(V x[V\V})+ W([V \ V] x V)) = 1 - 17(2? \ D) - jz(P \ D) = 1 , 

and thus 7f(2? x T>) = 1; the restriction it of each such 7f to the Borel sets contained in T> x T> is thus a 
probability measure on T> x 2?. 



3.8 



it follows that the 



Since (the extension to T> x I? of) d MlJ ,(-, •) is an element of C(I? x I?) by Lemma 
evaluation 7f H> if{d^^) = Jp x p d fitl ,(z,w) dlf(z,w) — J VxT> d^^{z, w) dir(z, w) is weak*-continuous on 
the weak*-compact set Tl(~p,V); it thus achieves its infimum in an element tt* of that set. As observed above, 
7r*, the restriction of tt* to the Borel sets contained in V x T>, is a probability measure on P x D, and an 
element of n(/i, is); this is the desired minimizer. 

□ 



Under rather mild conditions, the "standard" Kantorovich transportation (2.7) on a metric spaces (X, d) 
defines a metric on the space of probability measures on X . We will prove that our generalization defines 
a distance metric as well. More precisely, we shall prove first that 

d R {MM)=T^u) 

defines a semi-metric in the set of all disk-type surfaces. We shall restrict ourselves to surfaces that are 
sufficiently smooth to allow uniformization, so that they can be globally and conformally parametrized over 
the hyperbolic disk. Under some extra assumptions, we will prove that d R is a metric, in the sense that 
d R (Ai, J\f) = implies that M. and M are isometric. 

For the semi-metric part we will again adapt a proof given in [?] to our framework. In particular, we shall 
make use of the following "gluing lemma" : 

Lemma 3.10. Let ^1,^2,^3 be three probability measures on T>, and let ~k\2 € n(/ii, ^2), 7^3 G n(/i2)A t 3) 
be two transportation plans. Then there exists a probability measure tx on T> x T> x T> that has ^12^23 ds 
marginals, that is J Z3eV dTr(z 1 , z 2 , z 3 ) = dn^izx, z 2 ) , and f zieV dir(z 1 ,Z2,z 3 ) — dir 2 3(z2, z 3 ). 

This lemma will be used in the proof of the following: 

Theorem 3.11. For two disk-type surfaces M — (T>,[i), J\f — (D,v), let d R (M,N) be defined by 

d R {M,M) = T R {»,v). 
Then d R defines a semi-metric on the space of disk-type surfaces. 
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Proof. The symmetry of <ff v implies symmetry for Tj*, by the following argument: 

T d(^ l/ )= inf ,/ d*(z,w)dn(z,w) = inf / d*(w,z)dn(z,w) 



= inf / d^' (w,z)dTr(w 1 z), where we have set 7r(u>, z) = 7r(z, uj) 
= T,f (z/, /Lt) . ( use that n £ v)«?e II(^, /x)) 

The non-negativity of d^(-, •) automatically implies T^(fi, v) > 0. 

Next we show that, for any Mobius transformation m, T^(^l, m*(J,) = 0. To see this, pick the transportation 
plan 7r £ n(/i,mH,/i) defined by 

f(z,w)dTr(z,w)= / f(z,m(z))fj,(z)dvolH(z). 
VxV ' Jv 

On the one hand it £ n(/x, m*/it), since 

d7r(z,w)= / /Lt(^)dvolij-(z), 
AxX> Ja 

and 

d7r(2:,w)= / XB{w)dir(z,w) 

PxL! J VxV 

XB[m(z))n(z)dvol H {z) = / XB(w)n*(w)dvol H (w) , 

Jv 

where we used the change of variables w = m(z) in the last step. Furthermore, tt(z, w) is concentrated on 
the graph of m, i.e. on {(z,m(z)) ; z £ T>} C T> x T>. Since d^ mm „(z,m(z)) = for all z e 2? we obtain 
therefore T d (n, m*/x) < / CxI , d^ mrfl (z, w)di:{z, w) = 0. 

Finally, we prove the triangle inequality T d H (fii, /13) < T^(ni, fi 2 ) + T R (/i 2 , /U3) ■ To this end we follow the 
argument in the proof given in 0] (page 208). This is where we invoke the gluing Lemma stated above. 



We start by picking arbitrary transportation plans tt\2 £ n(/ii,//2) and 7123 £ 11(^2, /13). By Lemma 3.10 

) x D x T> with marginals 7 

dw{z 1 ,z 2 , 23) = dw i3 (zi,z 3 ) 



there exists a probability measure tt on V x 2? x 2? with marginals 7ri2 and 7^3. Denote by 7Ti3 its third 
marginal, that is 



Then 



z 2 £V 



T d (jn,fi a )< / d^^zi, z 3 )dwi 3 (zi, z 3 ) = / ^3)^(^1, 22, 23) 

JVxV JVxVxV 



< 



< 



< 



VxVxV 



(d* ufl2 (zi, z 2 ) + d* aili3 (z 2 , z 3 )) dir(zi, z 2 , z 3 ) 



d^ 1 ^ 2 {zi,z 2 )dn(z 1 ,z 2 ,z 3 ) + / d R 2 ^(z 2 , z 3 )dTr(z 1 , z 2 , z 3 ) 

VxVxV JVxVxV 

d^ 1 p 2 (zi,Z2)dTT 1 2(z 1 ,Z2) + / d^ 24l3 (z 2 , z 3 )dTr 23 (z 2 , z 3 ) , 
VxV JVxV 



where we used the triangle-inequality for d3„ listed in (Theorem 3.3). Since we can choose 7Ti2 and ir 23 to 
achieve arbitrary close values to the infimum in eq. (3.8) the triangle inequality follows. □ 

To qualify as a metric rather than a semi-metric, d R (or T^) should be able to distinguish from each other 
any two surfaces (or measures) that are not "identical" , that is isometric. To prove that they can do so, 
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we need an extra assumption: we shall require that the surfaces we consider have no self-isometries. More 
precisely, we require that each surface M. that we consider satisfies the following definition: 

Definition 3.12. A disk-type surface Ai is said to be singly p- H fittable (where g G R, g > 0) if, for all 
R > g, all zq G T>, and all conformal factors obtained in uniformizations of the disk M. there is no other 
Mobius transformation m other than the identity for which 

/ \[i(z) — [i(m(z))\ cIvoIh(z) = ■ 



Remark 3.13. This definition can also be read as follows: M. is singly g- H fittable if and only if, for all R > g, 
any two conformal factors /ii and /12 for M. satisfy: 

(1) For all z 6 T> there exists a unique minimum to the function w i— > d R ± „ 2 (z, w). 

(2) For all pairs (z,w) G T> xT> that achieve this minimum there exists a unique Mobius transformation 
for which the integral in (3.5) vanishes (with /ii in the role of fj,, and fi2 in that of v). 

Note that in order to ensure that the conditions in the definition hold for all conformal factors, it is sufficient 
to require that it holds for the conformal factor associated to just one uniformization. 

Essentially, this definition requires that, from some sufficiently large (hyperbolic) scale onwards, there are 
no isometric pieces within (T>,[i) (or (2?, v)). 

We are ready to prove the last remaining part of the main result of this subsection. We start with a lemma. 

Lemma 3.14. Let it G n(/i, v) be such that /p xX , d R v (z,w) dir(z,w) = 0. Then, for all zq G T> and S > 0, 
there exists at least one point z G £l Zo ,6 such that dj} l/ (z,w) = for some w eD. 

Proof. By contradiction: assume that there exists a disk Zo ,« such that d^ [/ (z,w) > for all z G Q Za ,s and 
all w G T>. Since 



dir(z,w) = I n(z) dvoln (z) > , 

Q.(z .S)xV Jn(z ,S) 

the set £1(zq,5) x T> contains some of the support of tt. It follows that 



/ 

Jnt 



IQ(z ,S)xT> 

which contradicts 



d^ v (z,w)dn(z, w) < / d% v (z,w)dn(z,w) = 

r2(z ,*)xX> JVxV 



□ 



Theorem 3.15. Suppose that A4 and M are two surfaces that are singly g- H fittable. If d R (A4, J\f) = 
for some R > g, then there exists a Mobius transformation m G Md that is a global isometry between 
M = (T>,ii) and TV = (T>,v) (where fi and v are conformal factors of M. and M , respectively). 

Proof. When d R (M, Af) = 0, there exists (see 0]) n G v) such that 



VxV 



d R „(z,w)dir(z 1 w) = 0. 



Next, pick an arbitrary point zq G D such that, for some wq G T>, we have d R v (zo, ivq) = 0. (The existence 
of such a pair is guaranteed by Lemma 3.14 ) This implies that there exists a unique Mobius transformation 
mo G Md that takes zq to wq and that satisfies !/(mo(z)) = fJ<(z) for all z G £l Zo ,R- We define 

P* = sup{p ; d^ v (zo, w ) = 0}; 
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FIGURE 1. Illustration of the 
proof of Theorem |3.15| 




clearly p* > R. The theorem will be proved if we show that p* = oo. We shall do this by contradiction. 



Assume p* < oo. Consider f2 ZOlP *, the hyperbolic disk around z of radius p* . (See Figure[T]for illustration.) 
Set e = (R — q)/2, and consider the points on the hyperbolic circle C — dQ Zo , p »- e - E . For every z\ £ C, 



consider the hyperbolic disk Q Zl . £ /2\ by Lemma 3.14 there exists a point z 2 in this disk and a corresponding 
point W2 £ T> such that djj ^(22,11)2) — 0, i.e. such that 



\p(z) — m!*v(z)\ cLvo\h(z) = 



for some Mobius transformation to' that maps zi to W2', in particular, we have that 
(3.9) p{z) = v(m'{z)) for all z £ Z2ifl . 

The hyperbolic distance from z^ to dVt Zo , p - is at least g + e/2. It follows that the hyperbolic disk ^ Z2ie + £ /4 
is completely contained in Q Zo p *; since p(z) = is(mo(z)) for all z £ £l Zop *, this must therefore hold, in 
particular, for all z £ ^ z ,, e + e /4- Since ^ Z2ie + e /4 C Q Z2 ,r, we also have fi(z) — v(m'{z)) for all z £ ^ Z2 , e + e /4, 
by (3.9). This implies v(w) = v(mo o (m')^ 1 (w)) for all w £ ^l W2 ,g+e/i- Because N is singly g- H httable, it 
follows that mo o (to') -1 must be the identity, or too = to'. Combining this with (3.9), we have thus shown 
that p,(z) — i/(m Q (z)) for all z £ £1 Z2 .r. 

Since the distance between Z2 and Z\ is at most e/2, we also have 

Q Z2 ,R 3 Q Zll R-s/2 = ^ Zl ,e+3e/2 ■ 



This implies that if we select such a point z 2 (zi) for each z\ £ C, then ^ 2o , p *- e - £ U ( U Zl£ c Q Z2 ( Zi ).r) covers 
the open disk £\ 0iP *+ £ /2- By our earlier argument, p{z) = is(m (z)) for all z in each of the Q Z2 ( Zi ).r] since 
the same is true on f2 ZOiP *_ e _ e , it follows that /i(z) = v(mo(z)) for all z in ^ Z0! p»+ £ /2- This contradicts the 
definition of p* as the supremum of all radii for which this was true; it follows that our initial assumption, 
that p* is finite, cannot be true, completing the proof. □ 



For (2?, /i) to be singly g- H fittable, no two hyperbolic disks f2 Z) _R, £l w ,R (where w can equal z) can be isometric 
via a Mobius transformation ro, if R > g, except if to = Id. However, if z is close (in the Euclidean sense) 
to the boundary of T>, the hyperbolic disk fl Zi R is very small in the Euclidean sense, and corresponds to 
a very small piece (near the boundary) of M. This means that single g- H fittability imposes restrictions in 
increasingly small scales near the boundary of Ai] from a practical point of view, this is hard to check, and in 
many applications, the behavior of Ai close to its boundary is irrelevant. For this reason, we also formulate 
the following relaxation of the results above. 

Definition 3.16. A surface M. is said to be singly A- M fittable (where A > 0) if there are no patches (i.e. 
open, path-connected sets) in M of area larger than A that are isometric, with respect to the metric on Ai. 
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If a surface is singly ^4-^fittable, then it is obviously also A'-^fittable for all A' > A; the condition of 
being A-^fittable becomes more restrictive as A decreases. The following theorem states that two singly 
A-^fittable surfaces at zero d^-distance from each other must necessarily be isometric, up to some small 
boundary layer. 

Theorem 3.17. Consider two surfaces M and M , with corresponding conformal factors \x and v on T>, and 
suppose d R (M,Af) = for some R > 0. Then the following holds: for arbitrarily large p > 0, there exist 
a Mobius transformation m G Mjj and a value A > such that if Ai and J\[ are singly A- M fi,ttable then 
p(m(z)) = v(z), for all z G flo, P - 



Proof. Part of the proof follows the same lines as for Theorem |3.15| We highlight here only the new elements 
needed for this proof. 



First, note that, for arbitrary r > and zq G T>, 

(3.10) vol M (tt Zo , r ) = / fi(z)dvol H (z) > vo\ H (n zo . r ) 

Jn Z(J , r 

This motivates the definition of the sets Oa,t, 

(3.11) A>T = \zeV 



min li(z) 



volff(J2o,i 



min p{z) 



min u(z') > r 

z'efi*,r vol^(0,f2o,r) 

A > is still arbitrary at this point; its value will be set below. 

Now pick r < R, and set e = (R— r)/2. Note that if z G 0a, r, then yo\m{^z,r) > volx (O z , r ) > A. 

Since fi is bounded below by a strictly positive constant on each Jlo,p'j we can pick, for arbitrarily large p, 
A > such that f2 ,p <^ 0A,r! f° r this it suffices that A exceed a threshold depending on p and r. (Since 
p(z) —> as z approaches the boundary of V in Euclidean norm, we expect this threshold to tend towards 
as p — > oo.) We assume that £Iq. p C 0a, r in what follows. 



Similar to the proof of Theorem 3.15 we invoke Lemma 3.14 to infer the existence of Zq,wo such that 
zq e fio,e/2 and d^(zo,Wo) = 0. We denote 

p* = sup{r' ; d£ v (zo,wo) = 0}; 

as before, there exists a Mobius transformation m such that v(m(z)) = p(z) for all z in f2 Zo P ». To complete 
our proof it therefore suffices to show that p* > p + e/2, since f2o,p C ^ Zo ,p+ e /2 • 



Suppose the opposite is true, i.e. p* < p + e/2. By the same arguments as in the proof of Theorem 3.15 
there exists, for each z\ G 9f2^ 0>jO »_ r _ e , a point 22 G &xi,e/2 such that d R u {z2 : ui2) = for some 102- Since 
the hyperbolic distance between z 2 and is bounded above by e/2 + p* — r — e + e/2 < p — r + e/2 < p, 
Z2 G flo, P C Oa.t, so that voLvf (Q Z2 ,r) > A. It then follows from the conditions on M. and M that v(m(z)) — 
p{z) for all z in £l Zo ,p* UJ7 Z2j ^ D ^z ,p* ^^zi,r+3e/2- Repeating the argument for all z\ G dQ ZOiP *- r - £ shows 
that v(m{z)) = /i(z) can be extended to all z G ^z ,p*+ £ /2, leading to a contradiction that completes the 
proof. □ 



So far, we have dealt exclusively with disk-type surfaces. The approach presented here can also be used for 
other surfaces, however. In order to apply this approach to sphere-type (genus zero) surfaces, for instance, 
we would need to change only one component in the construction, namely how to define the neighborhoods 
^z ,R i n a Mobius- invariant way. Since there exists no Mobius-invariant distance function on the sphere, 
we can not define the neighborhoods £1 Zq .r as disks with respect to such an invariant distance. We thus 
need a different criterium to pick, among all the circles centered at a point zq € M, the one circle that shall 
delimit £1 Zo .r- Since the family of circles is invariant under (general) Mobius transformations, it suffices to 
pick a criterium that is itself invariant as well. For our applications, we pick Q Zo ,a to be the interior of the 
circle around z that has the smallest circumference among all such circles with area (or volume) A (where 
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A e (0, 1) ). In the generic case this procedure defines a unique neighborhood that can be used in the same 
way as ^ Zo ,_r up till now. 

For higher genus (but still homeomorphic) surfaces A4, Af it is natural to use the universal coverings A4, Af, 
respectively. For genus greater than one, we could use again the disk-type construction. The fix here (to 
avoid infinite volume of the flattened surface via the universal covering) could be to restrict the compared 
zq G T> to one copy of AA in the hyperbolic disk (and similarly wq G T> in Af). Treating the genus one case 
can be done similarly with similarity transformations in C. 



4. Discretization and implementation 



Several steps are needed to transform the theoretical framework of the preceding sections into an algorithm, 
as described in detail in [5]. In a nutshell, the procedure requires three approximation steps: 1) approximating 
the smooth surfaces with a discrete mesh, 2) using discrete conformal theory to construct a discrete analog 
of uniformization for meshes, and 3) reducing the discrete optimization problem (resulting from replacing 



/i,z/ in eq. (3.8) by their discrete versions supported on a finite set of points) to a linear program. If an equal 
number of discrete point masses is chosen for the discrete measure on each of the two surfaces, and all of them 
are given equal weight, the corresponding search for the optimal bistochastic matrix automatically produces 
a minimizer that is a permutation. This means that the minimizer defines a map from (the discretized 
version of) one surface to (the discretized version of) the other. 

It follows that the surface distance given in this paper does indeed lead to a computationally efficient 
approach, both for finding the best similarity distance and for identifying the best correspondence between 
two (disk-type) surfaces. Figure [2] shows an example of a discrete surface and the corresponding approximate 
conformal density visualized as a graph over the unit disk. 

Efficient computation of a distance between surfaces is important for many applications. As an example, 
Figure [3] shows an application of our approach to the characterization of mammals by the surfaces of their 
molars [?] , comparing high resolution scans of the masticating surfaces of molars of several lemurs (small 
primates living in Madagascar). The figure shows an embedding of eight molars, coming from individuals 
in four different species (indicated by color). The embedding is based on the pairwise distance matrix 
(d R (A4i, A4j)), and it clearly agrees with the clustering by species, as communicated to us by the biologists 
from whom we obtained the data sets. 
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Figure 3. Embedding of the distance graph of eight teeth models using multi-dimensional 
scaling. Different colors represent different lemur species. The graph suggests that the 
geometry of the teeth might suffice to classify species. 
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Appendix A. 

This Appendix contains some technical proofs of Lemmas and Theorems stated in section [3j and [4] We start 



by proving that (3.1) docs indeed define a distance metric on the family of JL := {m*fi; m £ M-p }, where 
the /i are (smooth) conformal factors on T), as obtained in sections 2 and 3. We first state a more general 
lemma: 

Lemma A.l. Let X, d be a metric space, G a group, and T : g i— > T g a representation of G into the 
isometries of X, d; in particular, d is invariant under the action of the group G, i.e. d(T g x,TQy) — d(x,y), 
for all x, y £ X and all g £ G. Define C to be the collection of orbits of the representation of G, i.e. 
the elements of C take of the form {T g x; g £ G}, for some x £ X. Define d on C x C by d(cx,c 2 ) = 
infa; iecii X2 <£c 2 d{x\,x 2 ). Then d defines a semi-metric on C. 

Proof. It is obvious that d(ci,c 2 ) > for all Ci, c 2 in C; thus only the triangle inequality needs to be 
established. 

Since an element c\ of C can always be written as c\ — {T g x; g £ G}, where x is an arbitrary element of G, 
we obtain, for arbitrary c\, c 2 , c 3 in C, 

d(cx,c 3 ) = inf d(T g x,T g i z) where x £ c±, z £ C3 are arbitrary 
9,3'eG 

< d(T gi x,T ga z) for all g x , g 3 £ G 

< d(T gi x,T g2 y) + d(T g2 y,T g3 z) for all gx, g 2 , 53 £ G and all y £ X 
= d(T gi x,T g2 y) + d(T g > 2 y,T g , 2 (g 2 )-i g3 z) for all gx, g 2 , g' 2 , 93 € G and all y £ X. 

When g' 2 , g 2 m G are kept fixed, the group elements g' 2 (g 2 )~ 1 g3 run through all of G as 53 varies over G. By 
taking the infimum over the choices of gx, g 2 , g' 2 , 33 £ G in the last expression, we thus obtain 

d(cx,c 3 ) < d(a,c 2 ) + d(c 2 ,c 3 ) , 

where c 2 := {T g y ; g £ G}. Since y £ X is arbitrary, this proves the triangle inequality in C for all three-tuples 
in C. □ 

Note that one can use the invariance of d under the action of the group on X to define d(cx, c 2 ) via a single 
minimization (instead of two): for x £ c\, y £ c 2 , 

d(cx,c 2 ) = inf d(T g x,T gl y) = inf d(x,T q -i q ,y) = inf d(x,T g ,>y) . 
9,s'eG g,g'eG g"eG 



To apply this to (3.1 1, we choose X to be the set of nonnegative G 1 -functions on T> that have integral 1 with 
respect to the hyperbolic area measure on T>, and d the Kantorovich mass transport distance between them, 
with the "work" measured in terms of the hyperbolic distance metric d H on T>: 

d(fi,v) = inf / c? H (z, w) d^{z, w) . 
7ren(/i,i/) Jx>xv 
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The group G is here given by Mp, and the action of G on X by pull-back: T m (ji) = m*/i. To apply the 
lemma, we first need to establish that : 

Lemma A. 2. d(/x, v) — d(m*^t, m*v), for all \x, v in X, and all m in Afp. 

Proof. We first rewrite d(m*/i, m^v) in a different way. For each n £ n(/i, v), we define the probability 
measure m*7r on T> x T> by m*n(E) = ir ( { (m~ 1 z, m~ 1 w) ; (z,!»)e£}). It is straightforward to check that 
m»7r(Ax2?) = m^/^A) and m^nlT) x B) = m^viB) for all Borel sets A , B C I?; thus m*^ <E n(m >t ^, m^v). 
One can analogously define m*7r; again it is straightforward that m*m*Tt = tt. It follows that Tl^m^/i, 771*1/) 
is exactly equal to { m*7r ; 7r <E II(/z, i/) }. 

Consequently using the invariance d rt {mz 1 mw) = d K {z,w), we obtain 

d(m W )= inf / d «( Z , W ) d (m^ M = inf / ^(m.,™)^*) 
7ren(/i,i/) ,/p x p jren(/t,i/) ,/p x p 



inf / c? h (m, f ) c?7r(u, w) = d(/i, i/) 
"e n (f>") Jvxv 



□ 



It follows that we can indeed apply the first lemma, and that (3.1 1 defines a semi-metric on the equivalence 
classes of conformal factors, where two conformal factors are viewed as equivalent if one can be obtained 
from the other by pushing it forward (or backward) through a Mobius transformation. 

It turns out that in this case, the infimum over the choices m £ Afp is in fact always achieved (and is thus 
a minimum): 

Lemma A. 3. Let \x and v be conformal factors obtained by uniformizing two smooth disk-type surfaces, 



with Distance(fi, v) < oo defined as in (3.1 1. Then there exists a Mobius transformation m £ Afp such that 
Distance^, v) — d(m >t /i, v). 

Proof. Consider two arbitrary (but fixed) conformal factors [i and v on T>. There exists a sequence (w„) neN 
such that inf we n(/i,i/) Jp x p d n (m n (z),w) dir(z,w) —> Distance^, v) as n — > oo. Each of these m n can be 



written in the form given by (2.3), with corresponding a n £ V, and e " £ T := { z £ C ; \z\ = 1 }. By 
passing to a subsequence if necessary, we can assume, without loss of generality, that the sequences (a n ) neN 
and ( el6, ") ngN converge in V (the closure of V) and T, respectively, to limits we denote by a and e l6 . 

If a lies in the open disk T>, then it defines, together with e l9 , a corresponding m £ Af-p. We then have, for 
all z, w in T>, lim ra _ i . 00 rf H (m„(z), w) = d H (m(z), u>). On the other hand, for sufficiently large n we have 

d H {m n (z),w) < d H (m„(z),0) + d H (0,w) 

= d H (z,a n ) + d H (0,w) 

< d H (z,a) + 1 + d H (0,w) 

< d H (z,0) + rf H (0,a) + 1 + d H (0,w), 

where we have used the invariance of d H under Mobius transformations and m n (a n ) — in the second line, 
and where we assume n sufficiently large to ensure d H (a n ,d) < 1 in the third. 

Therefore d H (m n (z), w) is bounded, uniformly in n, by a function that is absolutely integrable with respect 
to 7r (by the argument used just before the statement of Lemma A. 2); the dominated convergence theorem 
then implies that 



TJ(/x, v) = lim / d ii (m n (z),w) dTr(z,w) 

n->oo J VxV 

d H (m(z), w) dir(z, w) , 



VxV 
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so that we are done for the case where a £ V. 

It remains to discuss the case where a £ T> \ T) = T, i.e. \a\ = 1. The proof will be complete if we show that 
this is impossible; we will establish this by contradiction. 

From now on, we suppose that \a\ = 1. By the integr ability of fx and v, we can find an increasing sequence 
of p n < 1 such that \x {{z ; 1 > \z\ > p n }) < 1/n and v ({z ; 1 > \z\ > p„ }) < 1/n . It is easy to check that 

R- p 



for \a\ > R and \z\ < p < R 



1 — a* z 



> 



1 - Rp 



inf 

\z\<p„ 



This lower bound tends to 1 as R tends to 1, regardless of the value of p < 1. It follows that there exist 
R„ < 1 so that 

Z — - (1 

— > (n + Pn)/(n + 1), for all a with \a\ > R n . 

1 — a* z 

Because \a\ — 1, we can find a k n £ N such that |ofe| > R n for all k > k n ; consequently |mfc(-z)| = 
\z — ctfe j / 1 1 — (<2fe)*z| > (n + /On)/ ( n + 1) f° r au k > k n and all z with \z\ < p n . It then follows that (with the 
notation V n ;= {z; \z\ < p n }) 



Vfc > k n , W £ II(/i, i/) : / d H (mk(z),w) d,Tr(z,w) 

JVxV 

d H (nik(z) , u>) d7r(z, 



> 



> 



> 



D„xD„ 



B„x2J r 

ln(n + 1) 



inf c? h (m,u) 

|i;|<p n ,|u|>(n+p„)/(n+l)) 



dTr(z, w) 



T>„ xX>„ 



> hl(n - 1} [1 - »r((I>\I> n ) x 2?) - ,(»x(D\D„)); 



> 



2 

ln(n + l) 



1 



where we have used that if \u\ < r < 1, and |i>| > (n + r)/(n + 1), then c? H (u,w) > J r 
i j(n+r)/(i+n) _j_ ^ _ i n (n+i) ^ shows, in particular, that 

d(/i,m^) = inf f d H (mk(z), w) dir(z, w) > — — 
7ren(/i,w) Jt>xv 4 

for all k > k n and n > 4. This implies that, for arbitrary n > 4, n G N, 



(n+r)/(l+n) 



l-t 2 



dt > 



Distance(/i, z/) = lim^oo d(/i, rn* k v) > 



ln(n + 1) 



i.e. Distance(/i, i/) = oo, a contradiction. This finishes the argument that \a\ = 1 is not possible, and 
completes the proof. □ 



It is now easy to see that Distance(/i, v) defines a true metric on the equivalence classes of conformal factors: 



Proposition A. 4. The Distance{p, v) defined in (3.1 1 is a metric on the set of orbits [i := {m*p, ; m 6 M-p } 
of conformal factors under the action of . 



Proof. In view of the first lemma, we need to prove only that Distance^, v) = implies that there exists a 
Mobius transformation m £ Mp such that v = m*/x. By the second lemma, we know that Distance(/i, v) = 
d(m + /i, v) for some m £ Mp. For this m there exist thus 7Tj, £ II(/i, v) such that B d H (m(z), w) diTf : (z, w) 
tends to as fc tends to oo. By passing to a subsequence if necessary, we can, using the weak*-compactness 
of the set of probability measures on T> x V, assume that ttj- — > W as k tends to infinity, in the weak*-topology, 
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where n is a measure of weight at most 1. Since the 7Tfc all have marginals fi and respectively, it follows 
that 7f must have these marginals as well, which guarantees that n is itself a probability measure and an 
element of n(/i, v). We have moreover 



= Distance^, v) = / e? H (m(z), w) dir(z, w) 

JVxV 



implying that the support of tt is contained in the subset { (z, m(z)) ; z E V} C V x T>. It then follows that, 
for any Borel set AcT>, 



H(z) dvo\ H (z) = -k(A x V) = tt(A x m(A)) = tt(V x m(A)) = I v(m(z)) dvo\ H (z) 
A J A 

This is possible for the continuous functions /x and v only if v = m* [i, or equivalently, /i = m*v 



□ 



3.3 



Next we prove the list of properties of the distance function u (z, w) given in Theorem 
Theorem 13.31 

The distance function d^^(z,w) satisfies the following properties 

(!) d ^ l , fim , :J (m^ 1 (z Q ),m 2 1 (w )) = d^ v (z Q ,w ) Invariance under (well-defined) 

Mobius changes of coordinates 

( 2 ) d ^( z o, wo) = d^^(wo,z ) Symmetry 

(3) d^ lJ (z n ,wo) > Non-negativity 

( 4 ) d^ v (z ,w ) = => Cl Zai R in (V,[i) and Sl Wo>R in (V,v) are isometric 

(5) d^ l , Vtl/ (m~ 1 (z ),z ) = Reflexivity 

(6) < 

,^ 3 ( z i; z 3) < dj} lf t 2 (zi, Z2) + d^ 2fjl3 (z2, Z3) Triangle inequality 
Proof. For (1), denote m 1 " 1 (^o) = z i, and m^ 1 (wo) = wi. Then 



dm',u, m tu( z ii w i) = i nf / \mX)i(z)-rn*rn* 2 v{z)\dvo\ H {z) 



= inf / \/j,(mi(z)) — v{m2{m{z)))\d\o\ H (z). 
m (z 1 )=w 1 Jn nH 

Next set fa = m 2 °m° mj" 1 . Note that m(zo) — ^o- Plugging ?ri2(m(z)) = m(mi(z)) into the integral and 
carrying out the change of variables mi (z) = z' , we obtain 



inf / I /Lt(z') - v(m(z')) \dvo\ H (z') = _ inf / | /z(z') - ^(m(z')) | dvol#(,2:') 

m(zi)=«>l Jn zi R m(z )=w Jn z0zR 



For (2), we use Lemma 3.2 and equations (2.5), (2.6) to write 



d v (z a ,w )= inf / \ i a{z)-m*v{z)\dvo\ H {z) 

m(z )=w Jn b 



inf / |(m 1 )*>(«>) - ^(w)|dvol ff (w) = d^ : (w ,z a ). 

m(z )=w jQ mQz Ji 
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(3) and (4) are immediate from the definition of d^ v . 



(5) follows from the observation that the minimizing m (in the definition (3.5) of dj} „) is toi itself, for which 
the integrand, and thus the whole integral vanishes identically. 

For (6), let TOx be a Mobius transformation such that m 1 (z 1 ) = z 2l and m 2 such that TO 2 (z2) = z 3 . Setting 
to = m 2 m i , we have 



^i,^^ 1 '^) < / \n 1 (z)-m*n 3 (z)\dvo\ H (z) 



(A.l) < / I nx(z) - mlfi 2 (z) I dvoljj(^) + / \m* 1 n 2 (z) ~ m* fi 3 (z)\dvo\ H (z) . 



The second term in (A.l I can be rewritten as (using Lemma 3.2 the change of coordinates mi(z\) = z 2 and 



the observation to* — mjmj) 



j m[u 2 (z) — m* u 3 (z) I dvo\n{z) = / | mi^m![n 2 {w) — mi*mlm 2 ii 3 (w) | dvoljy(w) 

I /i 2 (w) - m* 2 fi 3 (w) I dvol H (w). 



We have thus 



d^^izi, Z3) < / \ui{z)-m* l u 2 {z)\dvo\ H (z)+ I \ fi 2 {w) - m 2 u 3 (w) \ dvol H (w) , 

and this for any mi, to.2 £ Mp such that mi(zi) = z 2 and m 2 (z 2 ) = z 3 . Minimizing over toi and TO2 then 
leads to the desired result. 

□ 

Next we prove the continuity properties of the function $>(zq, wq, a) — j^, R s | /i(z)— v(m ZOiWOir7 (z)) \ dvolfj{z) 



3.6 



which were used to prove continuity of d^ u itself (in Theorem 3.7). 



stated in Lemma „ , ,, , 

Lemma 13.61 

• For each fixed (zq, wq) the function §(zp, wq, ■) is continuous on S±. 

• For each fixed a £ S\, <&(-, -, a) is continuous on V x V. Moreover, the family ( $(■, ■, a) 1 is equicontinuous. 

Proof. We start with the continuity in a. We have 

I <J>(z , w Q: a) - $(z ,w ,a') \ < / \v(m ZOjWOj(T (z)) - v(m ZOtWo y(z))\dvol H (z) . 

Jn(z ,R) 



Because v is continuous on V, its restriction to the compact set £1{wq, R) (the closure of £l(wo, R)) is bounded. 
Since the hyperbolic volume of Q(zq, R) is finite, the integrand is dominated, uniformly in cr', by an integrable 
function. Since m ZQ ^ WQ ^(z) is obviously continuous in a, we can use the dominated convergence theorem to 
conclude. 



Since S 1 is compact, this continuity implies that the infimum in the definition of dff u can be replaced by a 
minimum: 

d^ v (z ,w )= min / \fj,(z)-v(m(z))\dvol H (z). 



m(z )=w Jn(zoM) 

Next we prove continuity in zq and w (with estimates that are uniform in a 
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Consider two pairs of points, (zq,Wo) and (z' ,Wq) G T> x T>. Then 
mz ,w ,a) -${z' q ,w' ,ct)\ 



| pb{z) - v{m ZOiWCha (z)) I dvo\ H (z) - I I n(u) - p(m z ' otW ' o(7 (u) | dvo\ H (u) 

O(z ,-R) Jn(z' ,R) 



< 



< 



I fj,(z) - v(m ZOiWO}Cr (z)) | dvol^(z) - / | n(m Zo>z ' ot iz) - v(m z > 0jW > ^ o m ZO;Z > o ,i{z)) \ dvol H (u) 

O(2 0l -R) Jn(z ,R) 

( I fJ-(z) - n{m Zo y oA (z)) | + | u(m ZOjWo ^(z)) - v{m z[vW[va {m z ^ z , (yl {z))) \ ) dvo\ H {z) . 

Q(z ,R) 



On the other hand, note that for any 7 > 0, p. and v are continuous on the closures of Q(zq,R + 7) and 
fl(wo, R + 7), respectively; since these closed hyperbolic disks are compact, p and v are bounded on these 
sets. Pick now p > such that \z' — z \ < p, \w' Q — wo\ < p imply that Q(z' Q , R) C r2(z , ^ + 7) as weu as 
f2(u>Q, R) C ^(wq, R + 7). It follows that, if \z' — z Q \ < p and \w' — Wq\ < p, then | p(z) — p{m ZQZ ^i{z) | and 
I v { m z 00 ,w Q ,o{z)) — v(m z (^ w ' n ^ (m Zo ^ z ^i(z))) | are bounded uniformly for z € Q(zo,R). Since it is clear from 
the explicit expressions (3.7) that m ZOtZ > ,1(2) — > 2 and m z i w > . a (m ZaZ i .1(2:)) — > m 2o . lUo . (T (z) as Zg — > zo and 
w — > wo, we can thus invoke the dominated convergence theorem again to prove continuity of $(•, -,cr). 

To prove the equicontinuity, we first note that v is uniformly continuous on f2(u>o,-R) U f2(u>Q,i?), since v 
is continuous on the compact set Q(wo, R + 7), which contains Q(wo,R) U f2(w ,i?) for all w that satisfy 
|w — u>o| < p. This means that, given any e > 0, we can find 5 > such that \v(w) — f(w/)| < e holds for 
all w, w' that satisfy u>, w' € Q(wq, R) U Q(w' , R) and |w — w'\ < 5. This implies the desired equicontinuity 
if we can show that \m ZoWo(7 (z) — rn z ^ w r arT (m ZgZ r 1 (z))\ can be made smaller than 5, uniformly in a € Si, 
by making \z' — zq \ + \w' — Wq\ sufficiently small. 

We first estimate \m Zo>WOjCr (z) — m ZOtW ' t<T (z)\. With the notations of (3.7), we have 

, (z - w a)(l - Z^w' a) - (z - w' Q a){l - z^u> ct) 

a{z ,w Q ,a) - a(z ,w ,a) = T _ " 

(1 - z wo(J){l - z w' a) 

= (wg ~ w )a(|z | 2 - 1) 
(1 - z^w a)(l - zow' a) 



so that 



1 / \ / / m \ w o — w'o\ £ 

\a(z ,W ,a) - a{Z , W ,a)\ < — ; n p-r- ; 7- 1 , Tyj < JZ j rj TVlTj j 17| i , 7TT 

(1 - \z \ \wo\)[l - \z \(\w \ + Ol (1 - \zo\ \w \)[l - \zq\{\w \ + 0] 



when \wq — w' \ < £. It thus suffices to choose ^ so that £ < C(l — \zo\ \wq\)[1 — \z \(\wo\ + £)] to ensure that 
3 ,w ,cr) - a(z ,w' ,a)\ < £ 

T(z ,w Q ,a) - t(z ,w' ,<j) 



\a(zo,wo,a) — a(zo, w' , a)\ < (. For the phase factor r in (3.7) we obtain 

(1 - zow' a)(l - zqWoct) - (1 - zow a)(l - z Q w' a cr) 



(1 - z w a)(l - z w' a) 

_ (WQ - WqJZqOj- (WQ 7 - w' a )z <7 + |z | 2 (wqW - WqWq) 

(1 - z "w a : )(l - z^w' a) 

(W - W n )zp^~ Zq(wq - W' Q )<7 + |z | 2 [W(^(w n - Wp) + Wq(wq - w' Q ]) 

(1 - z^w a)(l - z^w' a) 

when \wo — w' \ < this implies 

, / x / xi l^olKI[2+|zo|(2K|+0] 

T(Z , u; 0) Ct) - r(z , W , cr) < — n r-r- —r — , 

(1 - \z Q \ \w \)[l - \z Q \{\w \ +01 
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which can clearly be made smaller than any £ > by choosing £ sufficiently small. All this implies that (use 



(3.7)) 



m 2 



l + UI) 2 

2 



rO) - m Zo>w ' Qi<T {z)\ < \t(z ,w ,(t) - T(z ,w' ,a)\ - - + \a(z , w ,cr) - a(z ,w' ,<r)\ - 
2(1 + N) 



- C (i-W) 2 ' 

which will be smaller than 8/2, uniformly in a, if £ < 8(1 — |z| 2 )/8; this bound on £ in turn determines 
the bound to be imposed on the £ used above. Hence \m ZOtWOt(7 (z) — m ZOtW ' gtCr (z)\ < 8/2 can be guaranteed, 
uniformly in a, by choosing \wq — w' Q \ < £ for sufficiently small £. 
One can estimate likewise 

\m ZotW > ottT (z) - m z > otVJ ' ot(r (m x ' 0jZot i{z))\ , 
and show that this too can be made smaller than 8/2, uniformly in a, by imposing sufficiently tight bounds 
on \z' Q — Zq\ and \w' Q — Wq\. Combining all these estimates then leads to the desired equicontinuity, as indicated 
earlier. □ 



To prove Lemma |3.8[ we shall use the following lemma: 



Lemma A. 5. Consider u k — e 1 ^ + e k , where \e k \ — > as k —> oo. Then there exists, for every e > 0, a 
K G IN such that for all k > K ; and all fh € Mr)fl,u k > 

inf |m(to)| > 1 — £. 

w£fl ,R 



The set Mrjfi. Uk used in this lemma is given by Definition 3.4 

we can write fh as 



Proof. From Lemma 3.5 



fh(w) — e 



■ ie w + u k e 



1 + u k e l6 w ' 



for some 9 £ [0, 2n). Substituting Uk — e 1 * + e k in this equation we get 

~, v ie w + (e^ +e k )e- ie ^ 1 + we'^ 9 ^ + e fce -^ 

m (w) = e = e v rj- — -. - n — . 

1 + ( e iV + e k )e ie w 1 + we 1 ' 9 "* + £^e ie w 

Writing the shorthand s for s = 1 have thus 



m(w) — e 



s + Wke ie w 



< 



M s k e-^ -s k e ie w 



< 



\s + e k e lS w\ 



< 



s + ek.e ie w 

M(i + M) 



Now for all w € CIq,r, \w\ < r R = tanh (R). This implies |s| > 1 — \w\ > 1 — r R , and 1 + |w| < 1 + r^, so 
that 

, _ , 1 + r R , _ , 1 + r fl 



m(w)-e^ < |e fc | 



1 — r ji — ]efc|r^ 1 - r R (1 + \e k 



Since |efc| — > the lemma follows. 
We are now ready for 



□ 



Lemma 3.8 Let {(z k , w k )} h> ^ C V x V be a sequence that converges, in the Euclidean norm, to some point 



(z' , w') £ T> x T> \ T> x T>, that is \z k — z'\ + \w k — w'\ — » 0, as k — >■ oo. Then, lim^oo df^ r (z k , w k ) exists and 
depends only on the limit point (z' ,w'). 
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Proof. Since (Y, «i')eDxD\DxD either z' £ V \ P or w' £ P \ P. Let us assume that z' e P \ P (the 
case u>' G P \ P is similar). Denote by m k an arbitrary Mobius transformation in -Mr>.o,™ fc - By symmetry of 
the distance and using a change of variables we then obtain 

df c (z k ,w k )= df t {w k ,z k ) 



mm 

m(w k )=z k Ja 



£(w) -£(m(w)) 



(ivoljj(w) 



mm 

rn(w k )=z k Jq r 



C(m k (w)) - £{m(m k (w))) dvo\ H {w). 



Now, recall that £(z) = £ (z) = £,(z)(l — \z\ 2 ) 2 , where £(z) is a bounded function, sup zgI > \£,(z)\ < Cg. From 
Lemma A. 5 we know that for every e > and for k > K sufficiently large, \m(m k ((w))\ > 1 — s for all 
w £ f^o,i?i and all m such that m{w k ) = z k . This means that for these k > K we have 

\£(m(m k (w)))\ = £(m(m k (w))) (1 - \m(m k (w))\ 2 ) 2 



< Cr(l - (1 - e) 2 ) 2 < C7 f e 2 (2 - e) 2 



for all w £ £Iq,r- Therefore 

df c (z k ,w k ) - 



< 



< 



mm 

m(w k )=z k Jq q r 



mm 

i(w k )=z k J Qq R 



< min 

m(w k )=z k Jq 



C(m k (w)) dvol H (w) 

C(m k (w)) - ^(m(m k (w))) dwo\ H (w) - 

| C(m k {w)) - £(m(m k (w))) - ((m,k{w)) } dvol H (w) 
£(m(m k (w))) dvolniw) — > 0, as k — > oo. 



£(m k (w)) dvol H (w) 



Therefore d^ ( -(z k ,w k ) converges, as k — > oo, if and only if J n \£(mk(w))\ dvoln (w) converges, and to the 
same limit, for any m k £ Mo,Q_ Wk . We can take, for instance, m k (w) = pji=~ which gives 

w + w k 



\C(m k (w))\dvol H (w) 



dvolniw). 



1 + w k w / 

For w £ ^o.-Rj |1 + Wkw\ > 1 — tr. It follows that this expression has a limit as k — > oo, and 

w + w' 



lim 



\((m k (w))\ dvol H (w) = 

Ft 

which clearly depends on w' , not on the sequence {w k } 



1 + w'w 



dvolffiw), 



a 
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